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Abstract 

To combine the power efficiency of Continuous Phase Modulation (CPM) with enhanced performance in fading 
environments, some authors have suggested to use CPM in combination with Space-Time Codes (STC). Recently, 
we have proposed a CPM ST-coding scheme based on L^-orthogonality for two transmitting antennas. In this paper 
we extend this approach to the three antennas case. We analytically derive a family of coding schemes which we 
call Parallel Code (PC). This code family has full rate and we prove that the proposed coding scheme achieves full 
diversity as confirmed by accompanying simulations. We detail an example of the proposed ST codes that can be 
interpreted as a conventional CPM scheme with different alphabet sets for the different transmit antennas which 
results in a simplified implementation. Thanks to -orthogonality, the decoding complexity, usually exponentially 
proportional to the number of transmitting antennas, is reduced to linear complexity. 

I. INTRODUCTION 



TO overcome the reduction of channel capacity caused by fading, Telatar [1], Foschini and Gans [2] 
^ described in the late 90s the potential gain of switching to multiple input multiple output (MIMO) 

systems. These results triggered many advances mostly concentrated on the coding aspects for transmitting 
antennas, e.g. Alamouti [3] and Tarokh et al. [4] for Space-Time Block Codes (STBC) and also Tarokh 
I — ^ et al. [5] for Space-Time Trellis Codes. 
^ Zhang and Fitz [6], [7] were the first to apply the idea of STC to CPM by constructing trellis codes. 
> In [8], Silvester et al. derived a diagonal block ST-code which enables non-coherent detection. In [9], 
^ BokolamuUa and Aulin described a concatenation approach to the construction of STC for CPM. A 
condition for optimal coding gain while sustaining full diversity was also recently derived by Zajic and 
<N Stiiber [10]. 

CN Inspired by orthogonal design codes, Wang and Xia introduced in [11] the first orthogonal STC for two 
^ transmitting antennas and full response CPM and later in [12] for partial response. Their approach was 
O extended in [13] to construct a pseudo-orthogonal ST-coded CPM for four antennas. To avoid the structural 
j>! ' limitation of orthogonal design, we proposed in [14], [15] a STC CPM scheme based on L^-orthogonality 
for two antennas. Sufficient conditions for orthogonality were described, L^-orthogonal codes were 
introduced and the simulation results displayed good performance and full rate. Here, motivated by this 
5^ .results, we extend our previous work and generalize these conditions for three transmitting antennas. 

The main result of the three transmit antennas case, is that it can, unlike the codes based on orthogonal 
design, achieve full diversity with a full rate code: 

1) the full rate property is one of the main advantage of using the norm criterion, instead of 
merely extending the classical Tarokh [4] orthogonal design to the CPM case. Indeed, in the classical 
orthogonal design approach, which is based on optimal decoding for linear modulations, the criterion 
is expressed as the orthogonality between matrices of elements, each of these elements being a 
definite integral (usually the output of a matched filter). On the contrary, in the design approach 
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used for non-linear modulations, the product in Eq. ([8]) is a definite integral itself, the integrand 
being the product of two signals. This allows more degrees of freedom and enables the full rate 
property. 

2) the full diversity property can be proved in a similar way to the classical case [7], with the help 
of the extensions proposed by Zajic and Stiiber [10]. 

Furthermore, it should be pointed out that the proposed coding scheme does not limit any parameter 
of the CPM. It is applicable to full and partial response CPM as well as to all modulation indexes. 

We first give the system model for a multiple input multiple output (MIMO) system with Lt transmitting 
(Tx) antennas and Lr receiving (Rx) antennas (Fig.[il). Later on, we will use this general model to derive 
an L2-0STC for CPM for Lt = 3. The emitted signals s(t) are mixed by a channel matrix A of dimension 
Lr X Lt. The elements of A, an,m, are Rayleigh distributed random variables and characterize the fading 
between the n*^ Rx and the m*'^ Tx antenna. The Tx signal is disturbed by complex additive white 
Gaussian noise (AWGN) with variance of 1/2 per dimension which is represented by a x Lt matrix 
n(t). The received signal 

y(t) = As(t)+n(t). (1) 



has elements yn,r and dimension Lr x Lt. We group the transmitted CPM signals into blocks 

'Sl,l{t) ... Si^lM' 



(2) 



similar to an ST block code with the difference that now the elements are functions of time and not 
constant anymore. The elements of Eq. ^ are given by 



\t) = \j exp (j 27r (t)) (3) 

for {Ltl + r — 1)T <t< (Ltl + r)T and m,r = 1,2, Lt. Here m represents the transmitting antenna 
and r the relative time slot in the block. The symbol energy Eg is normalized to the number of Tx antennas 
Lt and the symbol length T. The continuous phase 

7 

<PmAt) = ^m{Ltl + r) + hJ2 dtk(t - ^'T) + CmAt) (4) 

1=1 

is defined similarly to [16] with an additional correction factor Cm,,^) detailed in Section ITl-CI Further- 
more, / is indexing the whole code block, i the overlapping symbols for partial response and i' = Ltl+r — i. 
With this extensive description of the symbol dm% we are able to define all possible mapping schemes 
(cp. Section Hl-BI) . The modulation index h = 2mo/p is the quotient of two relative prime integers mg 
and p and the phase smoothing function q{t) has to be continuous for < t < 7T, for t < and 
1/2 < 'jT. The memory length 7 gives the number of overlapping symbols. 
To maintain continuity of phase, we define the phase memory 

OmiLtl + r + 1) = 9ra{Ltl + r) + UiLtl + r) (5) 

in a general way. The function ^ {Ltl + r) will be fully defined in Section III-CI from the contribution of 
Cmrit)- For a conventional CPM system, we have Cmrit) = and ^(2/ + 1) = |c?2;+i-7. 

In Section [III we derive the L^ conditions for a CPM with three transmitting antennas and introduce 
adequate mappings and a family of correction factors. In Section Unl we detail some properties of the 
code. In Section |lVl we benchmark the code by running some simulations and finally, in Section |Vl some 
conclusions are drawn. 
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Fig. 1. Structure of a MIMO Tx/Rx system 



II. Parallel Codes (PC) for 3 antennas 

A. Orthogonality 

In this section we describe how to enforce orthogonality on CPM systems with three transmitting 
antennas. Similarly to [15], we impose orthogonality by 

(3«+3)T 

j s{t)s^{t)dt = Esl (6) 

where I is the 3x3 identity matrix. Hence the correlation between two different Tx antennas Sm,r{'t) and 
Sm',r{t) is canceled over a complete STC block if 

{3l+3)T 



31T 

with m =^ m' . Now, by using Eq. (|3]) and dH) we get 

3 (3«+r)T ^ 

= I exp (j27r ■ [^^^(3/ + r) + /i ^ ^(^'^^(t - I'T) + c^,,(t) 

9 1 7 — 1 

(3Z+r-l)r 

- ^w(3/ + r) - /i ^ d5*ig(t - t'T) - c^.,,(t)] j dt. (8) 

i=l 

The phase memory 6'm(3/ + r) is time independent and therewith can be moved to a constant factor in 
front of the integrals. Similarly to [15], we introduce parallel mapping {Sm-r = c^m'^r) '^he data symbols 
and repetitive mapping {cm,r{t) = Cm,r'{t)) for the correction factors. The integral on three time slots can 
then be merged into one time dependent factor. Furthermore, we obtain a second, time independent factor 
from the phase memory. Now, by using Eq. (|5]) one can see that the condition from Eq. ^ is fulfilled if 

= 1 + exp(jai) + exp(jai) exp(ja2) (9) 

where = 27r [^^(3/ + r) —^m'i'^l + r)] and we get — exp(— jai) = 1 + exp(ja2). By splitting this 
equation into imaginary and real parts, we have the following two conditions: 

— 1 =cos(— ai) + cos(a2) (10) 

= sin(— ai) + sin(a2). (11) 

This system has, modulo 2n, two pairs of solutions 

(ai, a2) G {(27r/3, 27r/3), (An/S, An/3)}. (12) 

Hence orthogonality is achieved if ^m{3l + r) — ^m'(3/ + r) = 1/3 or ^m(3/ + r) — ^m.'(3/ + r) = 2/3 
for r = 1, 2 and for all combinations of m and m' with m ^ m' . In order to determine Cm(3/ + r), we 
detail in the following section the exact mapping and the correction factor. 
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Fig. 2. Mapping of thie data sequence to thie data symbols 



B. Mapping 

In this section we describe the mapping of the data sequence dj to the data symbols dm}r of the block 
code (Fig. O. To obtain full rate each code block has to include three new symbols from the data sequence. 
In general, the mapping of the three new symbols has no restriction. However, to construct a mapping 
two criteria are considered: 

• simplification of Eq. ([8]) 

• low complexity of function .^^(3/ + r). 

The first criteria is already determined by using parallel mapping {dmlr = dm^\)- Therewith the mapping 
for the m-dimension (Fig. [2l) is fixed. For the remaining two dimensions we choose a mapping similar to 
conventional CPM. The subsequent data symbols in r-direction are mapped to subsequent symbols from 
the data sequence. Also similarly to conventional CPM we shift this mapping by —i and get 

C. Correction Factor 

The choice of the phase memory and therewith of the function ^m(3/ + r) ensures the continuity of 
phase. If (f)m,ri{Ltl + r)T) = (f)m,r+i{{Ltl + r)T), we always have the desired continuity. Hence, 

7 7 

UiLtl + r) = hJ2 dlllH''^) + C'n,r((3/ + r)T) - hj^ d^n^l+i^li^T) - c„,,+i((3/ + r)T). (14) 

i=l i=l 

With a mapping similar to conventional CPM (Section III-BI) we can simplify the two sums into a single 
term and get for r = 1, 2 

^m(3/ + r) =^d'il+r-'y+l + £^,,.((3/ + r)T) - Cm,r-+l((3/ + r)T). (15) 

As the data symbols are equal on each antenna, the difference between two different ^m(3/ + r) does 
not depend on the data symbol d^^j^r-'y+i- Thus, when choosing parallel mapping, orthogonality only 
depends on the correction factor. 

To fulfill Eq. ^ for all antennas we take 

• for m = 1, m' = 2 

a, = y =27r[ci,,((3/ + r)T) - Ci,,+i((3/ + r)T) - C2,.((3/ + r)T) + C2,.+i((3/ + r)T)], (16) 



• for m = 2, m' = 3 

ar = Y =2vr[c2,.((3/ + r)T) - C2,,+i((3/ + r)T) - C3,.((3/ + r)T) + C3,.+i((3/ + r)T)] (17) 

• and for m = 1, m' = 3 we consequently get 

An 

ar = Y =2vr[ci,,((3/ + r)T) - Ci,,+i((3/ + r)T) - C3,.((3/ + r)T) + C3,.+i((3/ + r)T)]. (18) 

The other three possible combinations of m and m' with m ^ m' lead only to a change of sign and we 
get ar = —2n/3, —2n/3, —Air/S, respectively. Due to the modulo 27r character of our condition, these are 
also valid solutions. 

For simplicity, we assume similar correction factors for each time slot r of one Tx antenna Cm,i(t) = 
Cm,2{t) = Cm,3(i)- Sincc Eq. (fTSi) arises from Eq. (fT6l) and (flTI) . we have two equations and three 
parameters: Ci^r{t), C2,r{t) and c^^r{t)- Hence we define C2,r(i) = for r = 1,2,3 and we get Ci^r((3/ + 
r)T) - ci,,.+i((3/ + r)T) = 1/3 and C3,r((3/ + r)T) - C3,r+i((3/ + r)T) = -1/3 for r = 1, 2. The codes 
satisfying these conditions are coined Parallel Codes (PC). We will now describe some possible solutions 
of this type. 

An obvious solution for the correction factor is obtained for all functions which are for t = (3/ + r)T 
and ±1/3 for t = (3/ + r + 1)T, e.g. 

ci,(t) = -C3,(t) = I ■ ''^^^^'^^ (19) 

for (3/ + r)T < t < (3/ + r + 1)T. We denote this solution as linear parallel code (linPC). Of course, 
other choices are possible, e.g. based on raised cosine (rcPC). 
Another way of defining the correction factor is 

Cl,r(t) = -C3,r(t) = ^^(t - I'T) (20) 

1=1 

for {31 + r)T < t < (3/ + r + 1)T. In that case we take advantage of the natural structure of CPM, 
i.e. in Eq. (fT4l) all except one summands cancel down, similar to the terms with the data symbols. This 
definition has the advantage that we can merge the correction factor and the data symbol in Eq. (H]) and 
we obtain two pseudo alphabets shifted by an offset (offPC) for the first and third transmitting antenna 



Consequently, this -orthogonal design may be seen as three conventional CPM signals with different 
alphabet sets ^Id, fi^^and ^Id.^ for each antenna. In this method, the constant phase offsets introduce 
frequency shifts. But as shown by the simulations in next section, these shifts are quite moderate. 



III. Properties of PC CPM 

A. Decoding 

In this section, for convenience, we use only one receiving antenna but the extension to more than 
one is straightforward. Hence, the optimal receiver for the proposed codes relies on the computation of a 
metric over complete ST blocks followed by a maximum-likelihood sequence estimation (MLSE). That is, 



the CPM STC spans a trellis with piVf' ^ states. For non-orthogonal codes, each state leads to M^' = 
paths with the associated distance evaluated using the norm 

(3Z+3)T g 

Dl= j |l/l,r(t) - ^ai,mSm,r(t)f dt. (21) 
3/T 

Each metric is calculated over a length of LjT = 3T. 

Now, from the -orthogonality between Tx antennas, all crosscorrelations of the symbols sent (in Eq. 
(|2TI) ) are canceled out and we get 

3 3 (3/y)T 

Consequently, we are able to evaluate the metric separately for each symbol and antenna. Therewith the 
number of paths reduces to 3M as the metric on each path results from the sum of 3 partial metrics by 
Eq. (1221). 

The number of paths and states can be further reduced by using some special properties of CPM. There 
exist numerous efficient algorithms for MLSE. However, the efficiency of the detection algorithm is not 
the primary scope of this paper and shall be the subject of another upcoming paper. 



B. Diversity 

The performance of theses codes may be evaluated using the classical pair- wise error probability (PWEP) 
approach [7]. Now, for < t < N^T, let s{t) = [s^ (t) , S2 (t) , ■ ■ ■ ■,Si^{t)Y be the vector continuous-time 
representation [17] of the emitted signals 



Es 



exp (j27r[^™(l) + h^diq{t-{i- 1)T) + c„(t)]). 

i=l 



(23) 



In this representation, the phase memory terms 9m{Ltl + r) have all been included in the summation term. 
Only the initial phase 9mi^) remains. 
We can write s{t) as the product of two matrices 6 and C_{t) and a vector d{t) 



sit) 



Es 
LtT 



QC{t)d{t) 



(24) 



The matrix of initial values = diag(^) and the matrix of correction factors QSt) = diag(c(t)) are 
Lt X Lt diagonal matrices obtained from the vectors 



exp(i27rei(l)) 
exp(j27re2(l)) 



C{t) 



exp(j27rci(t)) 
exp(j27rC2(t)) 

exp{j27rci (t)) 



(25) 



As a result of the parallel mapping, the vector of data symbols can be written as 



d(t) =exp(^j27r/i^rfig(t-(i-l)r)) ■ [1 1 ... l]"^. 

i=l 



(26) 



To prove that our ST-Codes achieve full diversity, it is sufficient to show that their signal matrix 
has full rank [7], [18] 



NcT NcT NcT 

J |Ai{t)|2dt jAiWA^Wdt - jAi{t)Al {t)dt 



NcT NcT NcT 

/A2(t)A*{t)dt / lAaWpdt - /A2(t)A* (t)dt 





NcT NcT NcT 



(27) 



where the normalized vector A(t) gives the difference between the truly sent signals s^{t) and the 
estimated ones s^{t): 



A(t) 



r Ai(i) 1 




r ^iW- 


-liW -| 


A2W 


iLtT 




-hit) 

















(28) 



Now, Zhang and Fitz proved in [7, Prop. 1] that a necessary and sufficient condition for to be full 
rank is to have u'^ A(t) 7^ for all u G C^* unless u = 0. From Eq. (|24)). we have 



u^A(t) = u^QC{t) {d{t)-d{t)). 

By introducing the scalar function 

Nc 
i=l 

exp (j2nh''^^diq(t — {i — 1)T) 



(29) 



i=l 



we get from Eq.(l26l) (parallel mapping) that 

u^^A(t)=e(,,a)Wu"0C:(i)[l 1 ••• 1]"" 
Then, by rewriting Eq. ^ and ^ as Cm(t) = ^c(t). 



if 



u^^A(t) = £(d3)(t) ^<exp ( j27r[^^(l) + 



m — 1 



c(t) 



(30) 



(31) 



(32) 



m=l 



Since for d 7^ d, £:(d,d)(^) 7^ 0' ^ get u^A(t) = it would imply that 



£; «exp(j27r^^(l))) exp(j27r- 



m — 1 



c(t)) = 0. 



771=1 



Introducing the polynomial 



p{x) = J2 «exp(j27r^„(l))) 



(33) 



(34) 



m=l 



Eq. (133]) means that p(eJ27rc(i)/Lt^ ^ q pj-^j^^ ^j^g definition of c{t) (i.e. Eq. (O and (l2Ql)), this would 
imply that the polynomial p{x) of degree Lt — 1 vanishes on more than Lt different points. Thus, p = 
and Um = for all m. Consequently, by [7, Prop. 1], the signal matrix has full rank and all the codes 
(linPC and OffPC) achieve full diversity. 
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Fig. 3. Simulated bit error rate (BER) over a Rayleigh fading ciiannel 

IV. Simulations 

In this section we test the proposed algorithms by running MATLAB simulations. More precisely, we 
benchmark the offPC and linPC codes for two and three Tx antennas. For all simulations we use a Gray- 
coded CPM with a modulation index of 1/2, an alphabet of 2 bits per symbol (M = 4) and a memory 
length 7 of 2. We use a linear phase smoothing function q{t) (2REC) and for the initial phase 9i(l) 
optimal values corresponding to [17] are used. 

The modulated signals are transmitted over a frequency flat Rayleigh fading channel with complex 
additive white Gaussian noise. The fading coefficients an,m are constant for the duration of a code block 
(block fading) and known at the receiver (coherent detection). To guarantee a fair treatment of single and 
multi antenna systems the fading has to have a mean value of one. 

The received signal y„(t) is demodulated by the method introduced in Section IIII-A[ The evaluated 
distances are fed to the Viterbi algorithm (VA), which we use for MLSE. The trellis decoded by the 
Viterbi algorithm has pM'^~^ = 16 states and LtM = 12 paths leaving each state. In our simulations, the 
Viterbi algorithm is truncated to a path memory of 10 code blocks, which means that we get a decoding 
delay of 3 ■ lOT. 

Figure [3] shows the simulations results for one, two and three transmitting antennas. Both, linPC and 
offPC perform equally and corresponding to Section IIII-BI it can be seen that full diversity is achieved. 
For high SNR the BER decreases with 5dB/dec and the three Tx antennas code achieves a decay of some 
3.5dB/dec. 

One of the main reason for using CPM for STC is their spectral efficiency. It should be noticed 
nevertheless that the introduction of a correction factor with non-zero mean in the phase signal induces a 
bandwidth expansion and thus degrades this spectral efficiency. However, as detailed in [17] the additional 
bandwidth requirements are quite moderate. Namely, to achieve an attenuation in power of -30dB for a 
linPC code with 3 Tx antennas, h = 1/2, g = 2 and M = 8, a relative bandwidth expansion of only 7% 
is sufficient. 



V. Conclusion 



In this paper, we introduce a new family of L^-orthogonal STC for three antennas. These systems are 
based on CPM supplemented by correction factors to ensure L^-orthogonality. Structurally the proposed 
code family has full rate and we provide the proof of full diversity. Furthermore, we detail two simple 
representatives of the code family (offPC, linPC), where the offPC offers better performance and a very 
intuitive representation. Finally, it should be noticed that our approach can easily be generalized to any 
number of transmitting antennas with full rate codes that also achieve full diversity. 
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